Processing aggregated data: the location of clusters in health data
نویسندگان
چکیده
Spatially aggregated data is frequently used in geographical applications. Often spatial data analysis on aggregated data is performed in the same way as on exact data, which ignores the fact that we do not know the actual locations of the data. We here propose models and methods to take aggregation into account. For this we focus on the problem of locating clusters in aggregated data. More specifically, we study the problem of locating clusters in spatially aggregated health data. The This research was initiated during the GADGET Workshop on Geometric Algorithms and Spatial Data Mining, funded by the Netherlands Organization for Scientific Research (NWO) under BRICKS/FOCUS grant number 642.065.503, and has been also partially funded by NWO under the project GOGO. K. Buchin (B) · M. Buchin Department of Mathematics and Computer Science, TU Eindhoven, Eindhoven, The Netherlands e-mail: [email protected] M. Buchin e-mail: [email protected] M. van Kreveld Department of Computer Science, Utrecht University, Utrecht, The Netherlands e-mail: [email protected] M. Löffler Computer Science Department, University of California, Irvine, CA, USA e-mail: [email protected] J. Luo Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Beijing, China e-mail: [email protected] R. I. Silveira Departament de Matemàtica Aplicada II, Universitat Politècnica de Catalunya, Catalunya, Spain e-mail: [email protected] 498 Geoinformatica (2012) 16:497–521 data is given as a subdivision into regions with two values per region, the number of cases and the size of the population at risk. We formulate the problem as finding a placement of a cluster window of a given shape such that a cluster function depending on the population at risk and the cases is maximized. We propose area-based models to calculate the cases (and the population at risk) within a cluster window. These models are based on the areas of intersection of the cluster window with the regions of the subdivision. We show how to compute a subdivision such that within each cell of the subdivision the areas of intersection are simple functions. We evaluate experimentally how taking aggregation into account influences the location of the clusters found.
منابع مشابه
A Hierarchy Topology Design Using a Hybrid Evolutionary Algorithm in Wireless Sensor Networks
Wireless sensor network a powerful network contains many wireless sensors with limited power resource, data processing, and transmission abilities. Wireless sensor capabilities including computational capacity, radio power, and memory capabilities are much limited. Moreover, to design a hierarchy topology, in addition to energy optimization, find an optimum clusters number and best location of ...
متن کاملتحلیل زمان- مکان موارد سل ریوی در استان همدان با استفاده از خطر قابل انتساب جمعیت
Background and Objectives: One of the applications of population attributable risk percent (PAR%) is to estimate the disease burden in a population exposed to several risk factors. Therefore, this study was conducted to estimates the PAR% of the space-time clusters of pulmonary tuberculosis. Methods: In this study, the data of pulmonary TB cases were obtained from the health department of Ha...
متن کاملDetermination of Cluster Hydrodynamics in Bubbling Fluidized Beds by the EMMS Approach
The local solid flow structure of gas-solid bubbling fluidized bed was investigated to identify and characterize the particle clusters. Extensive mathematical calculations were carried out using the energy-minimization multi-scale (EMMS) approach for evaluating cluster properties including the velocity, the size and the void fraction of clusters in the dense phase of the bed. The results showed...
متن کاملA Clustering Based Location-allocation Problem Considering Transportation Costs and Statistical Properties (RESEARCH NOTE)
Cluster analysis is a useful technique in multivariate statistical analysis. Different types of hierarchical cluster analysis and K-means have been used for data analysis in previous studies. However, the K-means algorithm can be improved using some metaheuristics algorithms. In this study, we propose simulated annealing based algorithm for K-means in the clustering analysis which we refer it a...
متن کاملUsing data envelopment analysis (DEA) to improve the sales performance in Iranian agricultural clusters by utilizing business networks and business development services providers (BDSPs)
Business clusters play an important role in developing and improving the economic performance of countries and in promoting the welfare of people. Business development service providers (hereafter referred to as, BDSP) have a considerable role in providing specialized services pertinent to the conditions of active enterprises in clusters and in promoting their performance level in order to impr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- GeoInformatica
دوره 16 شماره
صفحات -
تاریخ انتشار 2012